CDS

Accession Number TCMCG075C14351
gbkey CDS
Protein Id XP_007035053.2
Location complement(join(28921925..28922068,28922566..28922616,28923072..28923117,28923826..28923875,28923958..28924001,28924088..28924287,28924379..28924449,28924525..28924614,28924705..28924836,28925057..28925107,28925955..28926044,28926654..28926718,28926922..28927042,28927283..28927491,28927699..28927801,28927897..28927962,28928068..28928127,28928219..28928328,28928441..28928513,28928591..28928593))
Gene LOC18603175
GeneID 18603175
Organism Theobroma cacao

Protein

Length 592aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007034991.2
Definition PREDICTED: actin-related protein 9 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category Z
Description Belongs to the actin family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K11673        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGATTATTTGAAAACTGTTGTCCCTTCTCAGCTCCTCTCCGAACGTGGCTCCAATCTCGTCGTCATCAACCCCGGCTCTGCAAATATAAGAGTAGGGTTAGCTAAGCAGGACTCTCCTTTCATCGTTCCTCATTGCATTGCTCGCCGAACCACCCAATTCTCCAAGTTAAATGTTCAAGATCAGTTGCTTAATTCTCAACTTACCACAGCGCAGCACATGGAGCGCGAAAAGGCTTATGATGTTATTGCGTCATTGTTGAAGATACCTTTCCTTGATGAAGAGGTTGCCAATAGTTCTGTTCCACGGAAGATGGGACGTGTTGATGGATATAATCTTCAGAATACCAGGAAGGATGTAGCCTTCACTTGGACTGATATACATGTGAAGGACATACATTCATCAGTGGCACCAGAAAGTTCAATGGATAAAAGTTTCATAAATGAGTCCTTGGTCCAACATGAAGGTACTGATTCAAAGGAACCTACTTTGACCAAACGCAAGTTCAGGGCGGTCATATGTGGTGAGGAAGCCCTAAGGATATCTCCCACTGAGCCATATTGCTTACGTCGTCCTATTCGTAGAGGTCACCTAAATATTTCACAACATTATCCCATGCAGCAGGTCCTTGAAGATCTGCACGCTCTATGGGACTGGATTTTGTCAGACAAACTGCATATCTCTCACCAAGAAAGGAGCTTATATTCTGCTATTCTTGTTGTGCCAGAAACATTTGATAATCGTGAGATAAAGGAGATCTTATCTATTTTACTGCGAGACTTGTGCTTTAGCTCAGCAGTGGTACACCAGGAAGGCTTGGCAGCAGTTTTTGGGAATGGTTTATCAACGGCATGTGTTGTAAATATGGGTGCGCAAGTGACATCGGTCATTTGCATTGAGGATGGAGTGGCTCTACCTAATACAGAGAAGACTTTACCCTTTGGTGGAGAGGATATATCAAGATGTCTTCTTTGGACTCAGAGGCATCATCAGACATGGCCACAAATTCGTACCGACATTTTGACAAAGCCTATAGATCTATTGATGCTGAACAGGCTAAAAGTGTCCTACTGTGAAATTAAGGAGGGTGAACTTGATGCTATAGCTGTAGTTCATTCTTATGAGGATGCAATGCCTCCTGGATCTCATAAGACAAGGCTAACTGCTCTGAACGTTCCTCCTATGGGTTTGTTCTACCCAACACTTTTGATTCCTGATTTGTATCCTCCACCACCTCGTTCTTGGTTTCATGACTATGAAGATATGCTGGAAGATACATGGCATGTTGAATTCCCAAGAAGACCTGACATGCCAGATGGTCTATATCCTGGAATTAATGTTGGGTTACCAATGTGGGATAACTACCCAATTTTTTCTATGAAACCAAAGAAAGAAGAGAAGGTTGGCCTAGCAGAAGCCATAACCAGTAGCATTCTTTCAACTGGTCGCATAGACCTGCAACGAAAATTGTTTTGTAGCATACAGTTGATTGGTGGAGTGGCTTTGACTGGTGGGCTAATTCCTGCTGTGGAGGAGAGAGTTTTACATGCCATTCCTTCAAATGAAGCAATCGATACTGTTGAGGTTTTGCAATCAAGAACGAATCCAACTTTTGTGTCTTGGAAGGGCGGAGCCATCCTTGGTGTTCTAGATTTTGGTCGGGATGCTTGGATACATCGAGAGGACTGGACCCGCAATGGGATTCACATTGGGAGTGGCAGGAAATACAAGGATTCTTATTTCCTTCAAGCACAGGCAATGTGTTACATCAATTCCTAG
Protein:  
MDYLKTVVPSQLLSERGSNLVVINPGSANIRVGLAKQDSPFIVPHCIARRTTQFSKLNVQDQLLNSQLTTAQHMEREKAYDVIASLLKIPFLDEEVANSSVPRKMGRVDGYNLQNTRKDVAFTWTDIHVKDIHSSVAPESSMDKSFINESLVQHEGTDSKEPTLTKRKFRAVICGEEALRISPTEPYCLRRPIRRGHLNISQHYPMQQVLEDLHALWDWILSDKLHISHQERSLYSAILVVPETFDNREIKEILSILLRDLCFSSAVVHQEGLAAVFGNGLSTACVVNMGAQVTSVICIEDGVALPNTEKTLPFGGEDISRCLLWTQRHHQTWPQIRTDILTKPIDLLMLNRLKVSYCEIKEGELDAIAVVHSYEDAMPPGSHKTRLTALNVPPMGLFYPTLLIPDLYPPPPRSWFHDYEDMLEDTWHVEFPRRPDMPDGLYPGINVGLPMWDNYPIFSMKPKKEEKVGLAEAITSSILSTGRIDLQRKLFCSIQLIGGVALTGGLIPAVEERVLHAIPSNEAIDTVEVLQSRTNPTFVSWKGGAILGVLDFGRDAWIHREDWTRNGIHIGSGRKYKDSYFLQAQAMCYINS